SOME RECENT PROGRESS ON THE RESTRICTION 
CONJECTURE 

TERENCE TAO 

Abstract. We survey recent developments on the Restriction conjecture. 



1. Introduction 

The purpose of this paper is describe the state of progress on the restriction prob- 
lem in harmonic analysis, with an emphasis on the developments of the past decade 
or so on the Euclidean space version of these problems for spheres and other hy- 
persurfaces. As the field is quite large and has so many applications, it will be 
impossible to completely survey the field, but we will try to at least give the main 
ideas and developments in this area. 

The restriction problem are connected to many other conjectures, notably the 
Kakeya and Bochner-Riesz conjectures, as well as PDE conjectures such as the 
local smoothing conjecture. For reasons of space, we will not be able to discuss 
all these connections in detail, but refer the reader to [65], [13], [28], [58]; for the 
connection between restriction and Bochner-Riesz see [21], [15], [12], [6], [52], [35], 
etc. 



2. The restriction problem 

Fix n > 2; in our discussion all the constants C are allowed to depend 1 on n. 

We begin by discussing the restriction problem. Historically, this problem origi- 
nated by studying the Fourier transform of L p functions in Euclidean space R" for 
some n > 1, although it was later realized that this problem also arises naturally 
in other contexts, such as non-linear PDE and in the study of eigenfunctions of the 
Laplacian. 
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^The question on how quantifying the exact dependence of the constants here on the dimension 
n as 71 — + co is however an interesting problem, although to my knowledge there are not many 
results in this direction at present. 
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If / is an i 1 (R") function, then the Riemann-Lebesgue lemma implies that the 
Fourier transform /, defined by 



f(0 ■■= f e- 2 ™<f(x) dx 



is a continuous bounded function on R™ which vanishes at infinity. In particular, we 
can meaningfully restrict this function to any subset S of R™, creating a continuous 
bounded function f\s on S. 

On the other hand, if / is an arbitrary L 2 (R n ) function, then the Fourier transform 
/ can be any function in L 2 (R"), and in particular there is no meaningful way to 
restrict it to any set S of zero measure. 

Between these two extremes, one may ask what happens to the Fourier transform 
of a function / in L p (R n ), where 1 < p < 2. Certainly we do not expect the Fourier 
transform / to be continuous or bounded, and it is easy to construct examples of LP 
functions which have an infinite Fourier transform at one point. In fact, it is easy 
to create such a function which is infinite on an entire hyperplane; for instance, the 
function 

where x\ is the first co-ordinate of x, lies in LP for every p > 1, but has an infinite 
Fourier transform on every point on the hyperplane {£ € R" : £i =0}. 

On the other hand, from the Hausdorff- Young inequality we see that / lies in 
the Lebesgue space L p (R™), where l/p+ 1/p' = 1. Thus / can be meaningfully 
restricted to every set S of positive measure. 

This leaves open the question of what happens to sets S which have zero measure 
but which are not contained in hyperplanes. In 1967 Stein made the surprising 
discovery that when such sets contain sufficient "curvature", that one can indeed 
restrict the Fourier transform of LP functions for certain p > 1. This lead to the 
restriction problem [46]: for which sets S C R™ and which 1 < p < 2 can the 
Fourier transform of an L P (R") function be meaningfully restricted? 

There are of course infinitely many such sets to consider, but we shall focus our 
attention here on sets S which are hypersurfaces 2 , or compact subsets of hypersur- 
faces. In particular, we shall be interested 3 in the sphere 

Ssphere ~ {£ € R" : |£| = 1}, (1) 



2 For surfaces of lower dimension, see [17], [44], [38], [1]; for fractal sets in R,, sec [39], [45], [38]; 
for surfaces in finite field geometries, see [40]; for the restriction theory of the prime numbers, see 
[26]. 

3 It is easy to see, using the symmetries of the Fourier transform, that the restriction problem 
for a set S is unaffected by applying any translations or invertible linear transformations to the set 
S, so wc can place the sphere, paraboloid, and cone in their standard forms (1), (2), (3) without 
loss of generality. 
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the paraboloid 

S parab :={lieR n :Zn = l\i\ 2 }, (2) 

and the cone 

Scone :={£eR n :£„ = |£|}, (3) 

where £ = (£, £„) € R™^ 1 x R = R", and we always take n > 2 to avoid trivial sit- 
uations. These three surfaces are model examples of hypersurfaces with curvature 4 , 
though of course the cone differs from the sphere and paraboloid in that it has one 
vanishing principal curvature. These three hypersurfaces also enjoy a large group of 
symmetries (the orthogonal group, the parabolic scaling and Gallilean groups, and 
the Poincare group, respectively). Also, these three hypersurfaces are related via 
the Fourier transform to solutions to certain familiar partial differential equations, 
namely the Helmholtz equation, Schrodinger equation, and wave equation; however 
we will not focus on applications to PDE in this paper. 



3. Restriction estimates and extension estimates 

Let S be a compact subset (but with non-empty interior) of one of the above 
surfaces S sp here, Sparab, S con e- We endow S with a canonical measure da - for the 
sphere, this is surface measure, for the parabola, it is the pullback of the n — 1- 
dimcnsional Lebesgue measure d£ under the projection map £ i— > £, while for the 
cone the pullback of is the most natural measure (as it is Lorentz- invariant. 

In order to restrict the Fourier transform of an L p (R n ) function to 5, it will suffice 
to prove an a priori "restriction estimate" of the form 

\\f\s\\ms-4«) < c P , q ,s\\f\\ L ^Br) ( 4 ) 

for all Schwartz functions / and some 1 < q < oo, since one can then use density 
arguments to obtain a continuous restriction operator from L p (H n ) to L q (S;da) 
which extends the map / i— » f\s for Schwartz functions. When the set S has 
sufficient symmetry (e.g. if S is the sphere), this implication can in fact be reversed, 
using Stein's maximal principle [48]; if there is no bound of the form 5 (4), then one 
can construct functions / G L P (R") whose Fourier transform is infinite almost 
everywhere in S. 

We will tend to think of R™ as representing "physical space" , whose elements will 
be denoted names such as x and y, while S lives in "frequency space" , and whose 
elements will be denoted names such as ^ or w. For the PDE applications it is 
sometimes convenient to think of R n as a spacetime R" -1 x R := {(x,t) : x <G 
R™~ , t € R} (with the frequency space thus becoming spacetime frequency space 
{(£, t) : £ G R™ _1 , t e R}), but we will avoid doing so here. 



One could also consider cylinders such as S k ~ 1 X R n ~ k C R°, but it turns out that the 
restriction theory for these surfaces is identical to that of the sphere S fc_1 inside R. fc . 

5 Indeed, it suffices for the weak-type estimate from L p (R n ) to L p <°° (S; da) to fail. See [48]; 
similar ideas arise in the factorization theory of Nikishin and Pisier. 
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It is thus of interest to see for which sets S and which exponents p and g one has 
estimates of the form (4); henceforth we assume our functions / to be Schwartz. 
We denote by Rs{p — > q) the statement that (4) holds for all /. From our previous 
remarks we thus see that i?s(l — > g) holds for all 1 < q < oo, while J?s(2 — > 
g) fails for all 1 < q < oo; the interesting question is then what happens for 
intermediate values of p. If S is compact, then an estimate of the form Rs{p — * q) 
will automatically imply an estimate Rs(p — > g) for all p < p and g < g by the 
Sobolev and Holder inequalities. Thus the aim is to increase the size of p and g for 
which Rs(p — > g) holds by as much as possible. 

A simple duality argument shows that the estimate (4) is equivalent to the "exten- 
sion estimate" 

ll(^) V |lip'(R-) <C P , q ,s\\F\\ (5) 

for all smooth functions F on S, where (Fda) w is the inverse Fourier transform of 
the measure Fda: 

{Fda) v {x) := / F(Z)e***- x dtT(Z). 
Js 

Indeed, the equivalence of (4) and (5) follows from Parseval's identity 



/ (FdaY(x)f{x) dx = [ F(Ofk) da(Z) 

JK n Js 
and duality. If we use Rg(q' — » p 1 ) to denote the statement that the estimate (5) 
holds, then R* s (q' — » p') is thus equivalent to Rs{p — > g)- 

Note that because F is smooth, it is possible to use the principle of stationary 
phase (see e.g. [47]) to obtain asymptotics for (Fda) v . However, such asymptotics 
depend very much on the smooth norms of F, not just on the L q (S) norm, and 
so do not imply estimates of the form (5) (although they can be used to provide 
counterexamples). Thus one can think of extension estimates as a more general way 
than stationary phase to control oscillatory integrals, applicable in situations where 
the amplitude function F(£) has magnitude bounds but no smoothness properties. 

The extension formulation (5) also highlights the connection between this problem 
and partial differential equations. For instance, consider a solution u(t,x) : R x 
R" — > C to the free Schrodinger equation 

id t u + Au = 

with initial data w(0, x) = uq(x). This has the explicit solution 




: 2*<(**+2'rt|€l a ) fio (0 



or equivalently 

u = (Fda) v 

where da := d£5(r — 27r|£| 2 ) is (weighted) surface measure on the paraboloid 
{(t,£) € R x R" : r = 27r|£| 2 }, and F is the function iio(Q restricted to the 
paraboloid. Thus, estimates of the form Rg(g' — ► p') when S is the paraboloid 
in R x R" to control certain spacetime norms of solutions to the free Schrodinger 
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equation. Somewhat similar connections exist between the cone (3) (in R x R") 
and solutions to the wave equation uu — Aw = 0, or between the sphere (1) and 
solutions to the Helmholtz equation Am + u = 0. We will not pursue these connec- 
tions further here, but see for instance [49] and the (numerous) papers descended 
from that paper. (Some other connections between restriction estimates and PDE- 
type estimates are summarized in [62] and the references therein; for the Helmholtz 
equation, see for instance [4].) 

4. Necessary conditions 

We will use the extension formulation (5) to develop some necessary conditions in 
order for TH* s (q' — > p') to hold. First of all, by setting F = 1 we clearly see that we 
must have (da) y E L p (R n ) as a necessary condition. In the case of the sphere (1), 
the Fourier transform (d<r) v (x) decays in magnitude like {l + \x\)-^ n -^/ 2 ( as can be 
seen either by stationary phase, or by the asymptotics of Bessel functions), and so 
we obtain the necessary condition 6 p' > 2n/(n — 1), or cquivalently p < 2n/(n + 1). 
A similar computation gives the same constraint p' > 2n/(n — 1) for the paraboloid 
(2), while for the cone the asymptotics are slightly different, giving the condition 
p' > 2(n- l)/(n-2). 

Let F be a smooth function on S with an L°° norm of at most 1. Since Fda is 
pointwisc dominated by da, it seems intuitive that (Fdo~) v should be "smaller" 
than {d<j) y . Thus one should expect the above necessary conditions to in fact be 
sufficient to obtain the estimate R* s (<x> — > p 1 ). For completely general sets S, this 
assertion is essentially the Hardy-Littlewood majorant conjecture; it is true when 
p' is an even integer by direct calculation using Plancherel's theorem, but is false 
for other values of p' (a "logarithmic" failure was established by Bachelis in the 
1970s; a more recent "polynomial" failure has been established independently by 
Mockenhaupt and Schlag (private communication) and Green and Ruzsa (private 
communication). See [38] for further discussion). However, it may still be that the 
majorant conjecture is still true for "non-pathological" sets S such as the sphere, 
paraboloid, and cone. 

Another necessary condition comes from the Knapp example [63] , [49] . In the case 
of the sphere or paraboloid, we sketch the example as follows. Let R ^> 1. Then, 
by a Taylor expansion of the surface S around any interior point £o> we see that 
the surface S contains a "cap" k C S centered at £ of diameter 7 <~ 1/R and 
surface measure ~ i? - ^™" 1 ' which is contained inside a disk D of radius <~ 1/R and 
thickness ~ 1/-R 2 , oriented perpendicular to the unit normal of S at x . Let F 
be the characteristic function of this cap k (one can smooth F out if desired, but 
this does not affect the final necessary condition), and let T be the dual tube to 
the disk D, i.e. a tube centered at the origin of length ~ R 2 and thickness ~ R 

6 There does not seem to be any hope for any weak-type endpoint estimate at p' = 2n/(n — 1), 
see [5]. 

7 We use X < Y or X = 0(Y) to denote an estimate of the form X < CY where C depends on 
S, p, q, but not on functions such as /, F, or on parameters such as R. We use X ~ Y to denote 
the estimate X < Y < X. 
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oriented in the direction of the unit normal to S at xq. Then (Fdo~) v has magnitude 
~ o(K) <~ i?^™ -1 ) on a large portion of T (this is basically because for a large 
portion of points x in T, the phase function e 2mx '^ is essentially constant on K). 
In particular, we have 

\\{Fda) y \\ Lv , {I{r) > pf/p' R-^- 1 ) ~ ^-("-1)^+1)/?^ 
while we have 

WFWl'IS;*,) <\K\- 1/Q ' <R~ in - lW - 

Letting R — > oo, we thus see that we need the necessary condition 

n + 1 n — 1 

in order for -Rg(<7' — ► p') to hold. (In the case of compact subsets of the paraboloid 
with non-empty interior, one can obtain the same necessary condition using the 
parabolic scaling (£,£„) i— > (A£,A 2 £„). For the full (non-compact) paraboloid, one 
can improve this to I ^ = ^y^). In the case of the cone, we can lengthen the 

cap k in the null direction (so that it now has measure ~ i?~("~ 2 ) and lives in a 
"plate" of length ~ 1, width ~ 1/R and thickness 1/R 2 ), which eventually leads to 
the stronger necessary condition y < !1 -y-', as before, this can be strengthened to 
y = ^y 2 - if one is considering the full cone (3) and not just compact subsets of it 
with non-empty interior. 

One can formulate a Knapp counterexample for any smooth hypersurface; the nec- 
essary conditions obtained this way become stronger as the surface becomes flatter, 
and in the extreme case where the surface is infinitely flat (e.g. when it is a hyper- 
plane), there are no estimates. 

The restriction conjecture for the sphere, paraboloid, and cone then asserts that the 
above necessary conditions are in fact sufficient. In other words, for compact subsets 
of the sphere and paraboloid the conjecture asserts that Rg(q' — > p') holds when 
p' > 2n/(n—l) and ^yp- < while for compact subsets of the cone the conditions 
become p' > 2(n — l)/(n — 2) and y < (i.e. they match the numerology of the 
sphere and paraboloid in one lower dimension). This conjecture has been solved 
for the paraboloid and sphere in two dimensions, and for the cone in up to four 
dimensions; see Figures 1 and 2 for a more detailed summary of progress on this 
problem. The restriction problems for the three surfaces are related; the sharp 
restriction conjecture for the sphere would imply the sharp restriction estimate for 
the paraboloid, because one can parabolically rescale the sphere to approach the 
paraboloid; see [52]. Also, using the method of descent, one can link the restriction 
conjecture for the cone in R™ + with the restriction conjecture for the sphere or 
paraboloid in R™, although the connection here is not as tight (see [55] for some 
further discussion). 

5. Local restriction estimates 

We now begin discussing some of the tools used to prove the above restriction 
theorems. The first key idea is to reduce the study of global restriction theorems 
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Dimension 



Range of p and q 



= 2,p'>8 
>(p'/3)V>4 



Stein, 1967 

Zygmund, 1974 [71] (best possible) 



n = 3 



= 2,p' > 6 

>{p'/2)',p'>A 

>(p'/2)',p'>A 

p'>4-l 
>7/3;p'>4- A 

>(p'/2Y;p'>4-§ 

> 170/77; p' > 4-| 

> (y/2)';p'>4-| r 



>26/ll;p'>4- 
>(p'/2)';p'>4-f 
>(p , /2)'; J />3 



2(»+l) 
n-1 

2(rt+l) 
n-1 



2(rt+2) 



Stein 1967 

Tomas 1975 [63] 

Stein 1975; Sjolin - 1975 

Bourgain 1991 [6] 

Wolff 1995 [64] 

Moyua, Vargas, Vega 1996 [42] 

Tao, Vargas, Vega 1998 [60] 

Tao, Vargas, Vega 1998 [60] 

Tao, Vargas 2000 [61] 

Tao, Vargas 2000 [61] 

Tao 2003 [59] 

(conjectured) 



n > 3 



>((n-iy/(n + l))';p'> 
> ((n-iy/(n+l))';j/> 

, 2(n+l) _ 
,P -* „_1 tn 
/ > 2n 2 +n+6 



> 



n 2 +n— 1 
2nj+n+6 . 



Ji. „/ ^ grof+n+6 

n 2 +3n+l'i J n 2 +n-l 

> ((n-iy/(n + l))';p' > 
g'>((n-iy/(n+l))';p'> 



Tomas 1975 [63] 
Stein 1975 
Bourgain 1991 [6] 
Wolff 1995 [64] 

Moyua, Vargas, Vega 1996 [42] 

Tao 2003 [59] 

(conjectured) 



Figure 1. Known results on the restriction problem Rs(p — > (?) 
(or -R^g' — > p')) for the sphere and for compact subsets of the pa- 
raboloid. (For the whole paraboloid, restrict the above exponents 
to the range q> = (^f)'). 



Dimension 


Range of p and q 




n = 3 


q'>(p'/3)',p'>6 
q' > {p'/S)';p' >4 


Strichartz 1977 [49] 

Barcelo, 1985 [2] (best possible) 


n = 4 


g'>(p'/2)',p'>4 
g' > (p'/2)';p' > 3 


Strichartz 1977 [49] 

Wolff, 2000 [69] (best possible) 


n > 4 


q'>{{n-2)p'/n)',p'>^ 

? '>((n-2)p'/n)';p'>^ 

9 '>((n-2y/n)';o'>^ 


Strichartz 1977 [49] 
Wolff, 2000 [69] 
(conjectured) 



Figure 2. Known results on the restriction problem Rs(p — > ?) 
(or -Rlj^' —* p')) for compact subsets of the cone. (For the whole 
cone, restrict the above exponents to the range q' = { ^ n ~^ p )')■ 



(where the physical space variable is allowed to range over all of R n ), to that of 
local restriction theorems (where the physical space variable is constrained to lie in 
a ball) . 
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More precisely, for any exponents p, q, and any a > 0, let Rs{p — » 9; a) denote the 
statement that the localized restriction estimate 

H/|slU«(S;dCT) <Cp,q,S,aR a \\f\\LP(B(x ,R)) ( 6 ) 

holds for any radius R > 1, any ball B(xo, R) '■= {x G R" : |x — xo| < i?} of radius 
i?, and any test function / supported in B(xq,R). Note that the center xo of the 
ball is irrelevant since one can translate / by an arbitrary amount without affecting 
the magnitude of /. 

Observe that estimates for higher a immediately imply estimates for lower a (keep- 
ing p, q, S fixed). Also, the local estimate Rs{p — > q; 0) is clearly equivalent to the 
global estimate Rs(p — > q) by a sending R — > 00 and applying a limiting argument. 
Finally, it is easy to prove estimates of this type for very large a; for instance, for 
smooth compact hypersurfaces S one has the estimate Rs(p — > q;n/p') just from 
the Holder inequality 

1/(01 <ll/lli <c p i?»/" ll/ll Lp{B(x ,R)- 

Thus the aim is to lower the value of a from the trivial value of a = n/p' , toward 
the ultimate aim of a = 0, at least when p and g lie inside the conjectured range of 
the Restriction conjecture. (For other p and q, the canonical counterexamples will 
give some non-zero lower bound on a). 

By duality, the local restriction estimate Rs{p — > 9; a) is equivalent to the local 
extension estimate — > p'; a), which asserts that 

IK i?dcr ) V |lLp'(B( 2; o,-R)) - C 'p^:S,a^ Q ||-F 1 ||L<!'(S;d CT ) ( 7 ) 

for all smooth functions F on 5, all R>1, and all balls B(xo,R)- 

The uncertainty principle suggests that since the spatial variable has now been 
localized to scale R, the frequency variable can be safely blurred to scale 1/R. In 
the case where S is a smooth compact hypersurface, this is indeed correct; the 
estimate (6) is equivalent to the estimate 



\\f\\Li(N 1/R (S)) < Cp,q,S,aR a 1/q \\f\\LP(B(x ,R)) (8) 

holding for all test functions 8 / on B (xo , R) , where Ni/ R (S) is the 1 /^-neighborhood 
of S. To see how (6) implies (8), translate the surface S by 0(1/ R) and then av- 
erage (6) over all such translations; to see the converse implication, introduce a 
bump function ipB(x ,R) concentrated near B(xq,2R) which equals 1 on J5(xo,i?), 
and exploit the reproducing formula f — f * ipB(x ,R) to control / on S in terms of 

8 One can in fact remove the hypothesis that / is supported on a ball, and replace (8) with the 
corresponding global estimate where / ranges over R n , provided that we have the Hausdorff- Young 
condition q < p'. This is another manifestation of the uncertainty principle; it can be proven by 
using a smooth partition of unity to divide a global / £ L p (R n ) into functions in L p (B(xq, R)) 
for various balls B(xq,R) and applying (8) to each piece. To sum, one can subdivide Ni/ R (S) 
into cubes of size l/R and apply a local form of the Hausdorff- Young inequality on each cube. 
We omit the details. 
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/ on neighborhoods such as Ni/ R (S), exploiting the fact that ipB(x ,R) will decay 
rapidly away from B(0, l/R). 

Of course, (8) is equivalent by duality to the estimate 

\\G V \\lp' (B(x (h R)) ^ C ps .S, a R a 1 ^\\G\\ Lq ' (_ Nl/R ( S y) (9) 

for all smooth functions G supported on Ni/ R (S). By another application of the 
uncertainty principle (similar to the transference principle of Marcinkiewicz and 
Zygmund), this estimate is also equivalent to the discrete version 

II J29(Oe 2mx -H LP ' {B{xoM)) < C p , q ,s, a R a+{n - 1)/ci \\g\\ lq , (K) (10) 

where A is any maximal 1 / i?-separated subset of S, and g is any (discrete) function 
on A. 

From the formulation (8) and Plancherel's theorem, we immediately obtain the 
local restriction estimate i?s(2 — > 2; 1/2) for smooth compact hypersurfaces S; this 
estimate can also be obtained from the Agmon-Hormander estimate or from the 
frequency-localized version of the Sobolev trace lemma. 

To convert local restriction estimates into global ones, the key tool used is the decay 
of the Fourier transform (da) v . Indeed, suppose we have a decay estimate of the 
form 

\{daY{x)\<C/{l + \x\Y 

for some p > 0. Then the contributions to (4) arising from widely separated portions 
of space will be almost orthogonal. For instance, suppose R > 1 and B(xo,R) and 
B(x\,R) are two balls which are separated by at least a distance of R. Then if f 
and /i are supported on B(xo, R) and B(x\ 1 R) respectively, the Fourier transforms 
/o|s and /i|s will be almost orthogonal on S: 

\{fo\s,fl\s)L2(S;d(T)\ = \{fod<T,h) L 2(R n )\ 

= K/0 * (der) V ,/l)| < CR- p \\fo\\ L ^B(xo,R))\\h\\L^B(x u Ri%,l) 

since the convolution kernel (da) v has magnitude 0(R~ P ) when applied to dif- 
ferences of points in B(xo,R) and points in B(xi 1 R). This almost orthogonality 
asserts in some sense that distant balls do not interact much with each other, and 
so will allow us to reduce a global restriction estimate to a local one. 

One heuristic way to view (11) is as follows. This estimate is in some sense a 
"bilinear" version of the (false) estimate 

||/o|U 2 (S;da) < C , -R _p/2 ||/o||l 1 (B(xo^)); ( 12 ) 

this estimate is of course not true since the limit of the estimate as R — > oo is 
absurd, nevertheless it is "virtually" true in the sense that it implies the true 
estimate (11) by Cauchy-Schwarz. Note that (12) is just the (false) local restriction 
estimate i?s(l — » 2;—p/2). While this estimate is not true, it is true for certain 
interpolation purposes; for instance, by combining it with the Agmon-Hormander 
estimate Rs(2 — > 2; 1/2), one can obtain the Tomas-Stein estimate Rs C^l^ ~^ 2 )> 
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or more generally Rs(p — * 2) for all p < 2 ^ 2 1 - ) . This heuristic argument can be 
made rigorous by using orthogonality arguments such as the TT* method; see [63], 
[49], or [47]. In the particular cases of the sphere and paraboloid, the Tomas-Stein 

estimate yields Rs C^+P -> 2 ); for thc cone ' ^ y ields R s(^=2 ~* 2 )- Notc that 
this is consistent with the numerology supplied by the Knapp example from the 
previous section. 

The Tomas-Stein argument uses orthogonality on L 2 (S;da), and at first glance it 
thus seems that it can only be applied to obtain restriction theorems Rs{p — > q) 
when q = 2. However, it was observed by Bourgain [6], [12] that the same type 
of orthogonality arguments, exploiting the decay of the Fourier transform of da, 
can also be used for restriction theorems which are not i 2 -based, albeit with some 
inefficiencies due to the use of non-i 2 orthogonality estimates. These ideas were 
then extended in [42], [11], [52], [60], [61]; we cite two sample results below. 

Theorem 5.1. [6], [12], [42], [60], [61] Let p be as above. If R*(p -> q; a) holds 
for some p + 1 > aq, then we have R*(p — > q) whenever 



p+l — aq' q p(p + 1 — aq) 

Theorem 5.2. [52], [53] Let p be above. If Rs{p — > p; a.) holds for some p < 2 and 
0<a<l, then we have Rs{p — * q) whenever 

q p log(l/a) 



The second theorem in particular has the following consequence: if Rs{p — * 
is true for all e > 0, then Rs(p — > P — e) is also true for every e > 0. (Thc 
converse statement follows easily from interpolation) . Thus we can convert a local 
estimate with epsilon losses to a global estimate, where the epsilon loss has now been 
transferred to thc exponents. This type of "epsilon- removal lemma" is common in 
this theory, see [11], [61], [53] for some more examples. 

The above results are probably not optimal, however they do emphasize the point 
that one can study global restriction estimates via their local counterparts. 



6. Bilinear restriction estimates 



We now turn to another idea in the development of restriction theory - that of 
passing from the linear restriction and extension estimates to bilinear analogues. 

The original motivation of this theory was the "L 4 " or "bi-orthogonality" theory 
developed in such places as [23], [19], [16], [14], [37]. The basic idea is that ex- 
pressions such as || (Fda) v \\ LP 'r£i n \ can be calculated very explicitly when p' is an 
even integer, and especially when p' is equal to 4. Indeed, we have by Planchcrel's 
theorem that 



II(^) v IIl<(R») - ll(^) v (^)l^ 2 (ir ) = 11^*^11^ 
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Thus one can reduce a restriction estimate such as R* s (q' — > 4) to an estimate of 
the form 

\\Fda*Fda\\ L2{lr) <C q \\F\\l q , {s . da y, 

the point here is that there is no oscillation in this estimate (since there is no Fourier 
transform), and this estimate can be proven or disproven by more direct methods. 
For instance when S is the circle in R 2 , there is a logarithmic divergence in the 
above estimate, since da * da blows up like x / 2 on the circle {x G R 2 : |a;| = 2} 
of radius 2, however by introducing the localizing parameter R one can easily prove 
the modified estimate 

||G*G|| L2(ir) < C q (\ogR)^R-^\\G\\l HNi/R(s))l (13) 

for all R > 1 and all G supported on N\/ R (S)\ comparing this with (9) we ob- 
tain the local restriction estimate i?£(4 — > 4, e) for any e > 0, which (by use 
of epsilon- removal lemmas such as Theorem 5.2) proves the optimal range of re- 
striction estimates for the circle (first proven by Zygmund [71], by a more direct 
argument). 

Similar arguments also give the optimal restriction theory for the cone in three 
dimensions, see [2]. At first glance, this theory seems to be restricted to L 4 , since 
it relies on PlancherePs theorem. However, one can partially extend these ideas to 
other exponents L p . The main point is that the linear estimate 

ll(^) V |l iP '(R") <C p , q ,s\\F\\ Li (S;da) 
is equivalent, via squaring, to the quadratic estimate 
\\(Fday(Fday\\ LP , /2{Rn) <C p , q , s \\F\\ 

Li' (S-A<j)W F Wli' {S:da) 

which we can depolarize as the bilinear estimate 

\\(F 1 dar(F 2 dar\\ LP , /2{K n ) < C p , g ,s||Fi|| L9 ' (S;dff) ||F 2 ||^ (S;d(7) . (14) 

In such an estimate, the worst case typically occurs when iq and F 2 are both 
concentrated in the same small "cap" in S: this is what happens in the Knapp 
example, for instance. 

The strategy of the bilinear approach to restriction theory is to rewrite the linear 
estimate (5) as the bilinear estimate (14), which in turn is a special case of a more 
general estimate of the form 

\\(F 1 da i r(F 2 da 2 r\\ LP , /2(Kn) < C^sJ^iH Li' (S 2 ;da 2 )> 

(15) 

for arbitrary pairs of smooth compact hypersurfaces Si , S 2 with surface measures 
dai, da 2 respectively, and all smooth iq, F 2 supported on Si and S 2 . We let 
R* Si g 2 (q' xi;'^ p'/2) denote the statement that the estimate (15) holds. Then by 
the above discussion, R* s {q' — ► p 1 ) is equivalent to R* s s (q' x q' — > p'/2). Thus linear 
restriction estimates are special cases of bilinear estimates. However, there are 
bilinear estimates that cannot be derived directly from linear ones. For instance, 
let 5*1 := {(£i,0) : £i G R} and £2 := {(O,^) : £2 G R} denote the x and y axes 
in R 2 . Then we have (Fidai) v (x, y) = Fi(x) and (F 2 da 2 ) v (x,y) = F 2 (y), and so 
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there are no global restriction estimates of the form R s (q' — ► p') or Rg 2 (q' — * p') 
unless p' = oo, since the Fourier transforms do not decay at infinity. However, since 

(F 1 da 1 ) v (F 2 da 2 ) v (x,y) = F 1 (x)F 2 (y), 

we see from the one-dimensional Plancherel theorem that we have the bilinear re- 
striction estimate R Si S2 (2 x 2 — > 2). Note however that the symmetrized analogues 
Rg S] (2x2^2) and R* S2 Sa (2 x 2 — > 2) are false. Thus the bilinear estimate ex- 
ploits the transversality of Si and S2 . 

A higher-dimensional analogue of this estimate is known: if Si and S2 arc two 
smooth compact hypersurfaces which are transverse in the sense that the set of 
unit normals of Si are separated by some non-zero distance from the set of unit 
normals of S 2 , then we have R s Sa (2 x 2 — > 2). This can be easily seen by using 
Plancherel to convert the bilinear restriction estimate to a bilinear convolution 
estimate 

HCFidtri) * (F 2 da 2 )\\ L2(Ir) < C p ^g u g 2 \\Fi\\ L 2 {Sl . dai) \\F 2 \\ L 2 {S2 . drT2) , 
and then using the Cauchy-Schwarz estimate 

\\(Fidai) * {F 2 da 2 )\\ L2(£C) < ||(|Fi| 2 dtri) * (\F 2 \ 2 da 2 )\\ ^^Wdtn) * da 2 \\ Loo(£C) 

and using transversality to bound the second factor. Generalizations of these "bi- 
linear L 2 " estimates have arisen in recent work in non-linear evolution equations 
(starting with the work of Bourgain [10] and Klainerman-Machcdon [30] and con- 
tinued by numerous authors, see for instance [29]) and are especially useful for 
handling non-linearities which contain derivatives arranged to create a "null form" , 
but we will not pursue this matter here, and refer the reader instead to [25], [24], 
[57]. There has been also some work in generalizing these bilinear estimates to 
weighted settings, see [3]. 

Now let Si and S 2 be two compact transverse subsets of the sphere or paraboloid. Of 
course, any restriction theorem R* s (q' —* p') for the sphere or paraboloid will imply 
a bilinear restriction theorem R* Si S2 (q' x q' — > p'/2) for the pair Si, S 2 . However, 
the transversality allows us to prove more estimates in the bilinear setting than 
the linear one; we have already seen the estimate R s s (2x2^2), whereas the 
linear restriction estimate R* s (2 — ► 4) is only true in three and higher dimensions. 
One reason for this is that there is no exact analogue of the Knapp example in 
the transverse bilinear setting. Indeed, the best necessary conditions known on 
R*S 1 ,S 2 ( ( l' x l' ~~ * P'/ 2 ) arc tnat 

2n n + 2 n n + 2 n — 2 

P>— 7; ~^ + -<n; — ; - + — f -<n-l; 16 
n + 1 p' q p' q' 

see [60] or [24], where one develops bilinear analogues of the Knapp examples. This 

is somewhat less stringent than the corresponding conditions 

2n n + 1 n — 1 . . 

n + 1 p' q 

for the linear problem R* s (q' — > p'). The bilinear restriction conjecture asserts 
that the necessary conditions (16) are in fact sharp. This conjecture is still open 
except in dimension two, but recently it has been shown that (up to endpoints) it 
is equivalent to the usual restriction conjecture for the sphere and paraboloid. 
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Up until now, we have viewed bilinear restriction estimates as being more complex 
generalizations of linear restriction estimates, which seems to offer no incentive to 
study the bilinear estimates until the linear ones are settled. However, it turns out 
that one can use the bilinear estimates to go back and deduce new linear estimates, 
and indeed all the recent progress on the restriction problem has been obtained in 
this manner. The key observation is that one can perform a Whitney decomposition 
of the product manifold S x S around the diagonal A := {(£,£) : £ G S} so that 
S x S\A decomposes as the disjoint union of sets of the form 5i x S2, where Si, S2 
are disjoint subsets of S whose separation is comparable to their diameter. This 
allows one to obtain bilinear restriction estimates of the form R* s s (p' xp'^ <z'/2) 
(and hence R s (p' — > q 1 )) from estimates of the form Rg lv s 2 (p' x p' ~~ * q'/2), using 
some rescaling and orthogonality estimates to sum up (and discarding the diagonal 
A, which is of measure zero); see [60] for more details. Of course one cannot hope to 
have an unconditional implication of the form R*s 1 .s 2 (p' x p'^ <l'/2) R*s(p' ~* 
q'), since the necessary conditions (16) for the former are weaker than those (17) 
for the latter; however, we can do the next best thing: 

Theorem 6.1. [60] Let p, q obey the necessary conditions (17), and suppose that 
R* Si S2 (p' xp' — > q') is true for all (p, q) in an open neighborhood of (p, q), and some 
pair Si, S2 of compact transverse subsets of the paraboloid. Then R* s {p' — > q') is 
true. 

A similar result is true for the sphere, except that one must make Si and S2 subsets 
of a certain parabolically rescaled version of the sphere; see [60] for more details. 

The above theorem (and ones like it) allow one to pass freely back and forth between 
linear and (transverse) bilinear restriction estimates. For instance, this theorem can 
be used to provide an alternative proof of Zygmund's estimate (which asserts in 
particular that R* s (4 — > 4 + e) when S is the unit circle) from the basic estimate 
Rg S2 (2 x 2 — ► 2) for transverse sets. Although the bilinear estimates appear 
more complicated, they are in fact easier to analyze because they consist purely 
of transverse interactions, excluding the parallel interactions which often cause the 
most trouble (cf. the Knapp example). 

One can of course formulate local bilinear restriction estimates R s Sa (p' xp'^ 
q';a), which assert that 

||(i ;i idcri) v (F2dcr2) v || L p'/2( B ( a;o ,_ R )) < C p . q . Sl ,s 2 ,aR a \\Fi\\ Lq > (Si . dai) \\F 2 \\ Lq > (S2 . drT2) . 

One can of course reformulate these estimates using the uncertainty principle in 
a similar way to before, though some reformulations are not available because the 
notion of dualizing a bilinear estimate becomes difficult to use. There are also 
bilinear "cpsilon-removal" lemmas available; for instance, we have 

Theorem 6.2. [11], [61] Let Si, S2 be compact surfaces obeying some decay esti- 
mate 

\{do-i) y {x)\,\{do-2)\x)\<C/{l + \x\y 
for some p > 0. Suppose we have the local bilinear restriction estimate R* Si Ss (2 x 
2 — ► q, e) for all e > 0. Then we have the global bilinear restriction estimate 
R*s u s 2 ( 2 x 2 1 + £ ) f° r al1 £ > °- 
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More quantitative versions of this estimate have been proven, see e.g. [61], Lemma 
2.4. See also [31] for a more PDE-based approach to this epsilon-removal lemma. 

The bilinear estimate R s g 2 (2 x 2 — > 2) holds for all surfaces Si, S2 which are 
transverse. If both Si and S 2 are flat, then this estimate is sharp; however one can 
improve this estimate slightly when Si and S2 have some curvature. For instance, if 
Si and S2 are transverse subsets of the paraboloid in R™, then we have Rg S2 (p x 
p — > 2) for all p > 3 ^" 2 , see [60] . To see why we should gain over the p = 2 estimate, 
consider the following. Using Plancherel, we can rewrite R* s s (2 x 2 — > 2) as the 
bilinear convolution estimate 

\\(Fidai) * (F 2 da 2 )\\l 2(Rn) < q|Fi||£ 2(so ||F 2 ||£ w 

Let us suppose for the moment that we are in a model case, where the Fj , j = 1, 2 
are characteristic functions, for the sets {(£j,§|£j| 2 ) : £j € Clj} for some disjoint 
bounded open subsets Oi, Q 2 of R"; the right-hand side is thus CjOi | IO2I - Then 
by discarding some Jacobian factors (which are harmless due to the transversality) , 
the left-hand side is essentially the volume of the 3n — 1-dimcnsional set 

{(LvirirU e fiixfi2xfl 1 xfl 2 :? 1 +e 2 = e 3 +? 4 ; Hi\ 2 + \i 2 \ 2 = Hf + liil 2 }- 

The two constraints imply that £ 2 and £ , £ 4 from the opposing diagonals 
of a rectangle. In particular, £ g lies on the hyperplane 7r(^ 1 ,^ 4 ) containing ^ and 
orthogonal to ^ — £ , and then £ 2 can be recovered from the other three frequencies 
by the formula £ 2 = £ 3 + ^ — ^. Thus, by Fubini's theorem, the volume of the 
above set is bounded above by 




(discarding the constraint £2 € ^2)- Since is bounded, we may make the very 
crude estimate 

1^2 1-17^,^)1 <C, (18) 

from which the desired bound of C|f2i||f22| follows. 

The estimate (18) can of course be improved when ^2 is small, using the standard 
L' p bounds for the Radon transform. This is made rigorous in [42], [43], [60], 
culminating in the above-mentioned bilinear restriction estimate R% 1 s 2 (P x P ~~ * ^) 
for all p > 3 ^" 2 . This issue of exploiting the possible gain over (18) also arises in 
some recent developments [59] in bilinear restriction theory, which we shall return 
to later. 

The latter two conditions of (17) meet when q' = 2, when they assert that R s S2 (2x 

2 — > q) for all q > 2 ( n + 2 > . This was first conjectured by Machedon and Klainer- 
man for both the paraboloid and cone. Despite the original restriction conjecture 
remaining open, this conjecture has been completely solved for the cone and solved 
except for an endpoint for the paraboloid; see Figures 3, 4. We shall discuss this 
recent progress in the next few sections. 
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Dimension 


Range of g 




n > 2 


q > 2 


Planchercl + Cauchy-Schwarz 


n > 3 


y — n—2 


Strichartz 1977 [49] 


n = 2 


?> 2 -2i8 


Bourgain 1995 [11] 


n = 2 


?> 2 -lfl 


Tao, Vargas 2000 [61] 


n > 2 


^ 2(n+2) 
y n 


Wolff, 2000 [69] 


n > 2 


a ^ 2(n+2) 
y — n 


Tao, 2001 [55] (best possible) 



Figure 3. Known results on the bilinear restriction problem 
i?s lX s 2 (2 x 2 — > qr), for transverse compact subsets of the cone. 



Dimension 


Range of g 




n > 2 
n > 3 
n = 2 
n = 2 
n > 2 
n > 2 


g > 2 

^ 2(n+2') 

" n 

^ 2(n+2) 

" — n 


Planchcrel + Cauchy-Schwarz 

Strichartz 1977 [49] 

Tao, Vargas, Vega 1998 [60] 

Tao, Vargas 2000 [61] 

Tao, 2003 [59] 

(conjectured) 



Figure 4. Known results on the bilinear restriction problem 
-Rsixs 2 (2 x 2 — > q), for transverse compact subsets of the parabo- 
loid. 



7. The wave packet decomposition 

We have discussed two of the tools in the modern theory of restriction estimates: 
the reduction to local estimates, and the reduction to bilinear estimates. We now 
turn to a third key technique: the introduction of wave packets. 

For sake of illustration, suppose we wish to prove the local restriction estimate 
Rs(p — > 1; ot) where S is the sphere (1); the exponent 1 can of course be changed, 
but this does not significantly alter the argument sketched below (except that the 
estimates on certain coefficients ct will change). We use the formulation (9), fixing 
x = 0, thus we have to prove 

11*2 \\lp'(B(0,R)) ~ 

for R > 1 all bounded functions G on N 1 / R {S). Henceforth we use X < Y or X = 
0(Y) to denote the estimate X < CY for some C depending on such parameters 
as n, p, a, while we call a function "bounded" when it has an L°° norm of 0(1). 

Fix R and G, and observe that the annular region Ni/r(S) can be divided into 
~ ij( n_1 )/ 2 finitely overlapping disks k of width <~ 1/VR and thickness 1/R. If G 
is a function on Ni/ R (S), we can thus use a partition of unity to divide G — J2 K G K , 
where each G K is a bounded function supported on one of these disks G K . Our task 
is thus to show that 

II X! ^Wlp' (B(0,R)) ~ 

K 
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The question then arises as to what G^ looks like. We first consider some examples. 
Suppose that the disk G K is centered at a point uo R e S n_1 , which by the geometry 
of the sphere implies that u K is also essentially the normal to the disk n. If G K is 
a bump function adapted to n, then by duality G^ would be concentrated on the 
RxR 1 / 2 tube 

T Kfi ■= {x G B(0, R) : tt u ±x = 0{R^ 2 )}, 
where 7r w i is the orthogonal projection onto the hyperplane uijr := {x G R™ : 
x ■ uj k = 0}. Indeed, since k has volume roughly R~( n+1 ^ 2 , we would expect to 
equal a function ipT Ki0 of the form 

V>t„» - R-W 2 e 2 ™*-*<l>T Hi0 , (19) 

where 0t k>o is a Schwartz function adapted to the tube T Kj0 which has size 0(1) on 
this tube and is rapidly decreasing away from this tube. We call the function tpT Kt0 
a wave packet adapted to the tube T K>0 ; this object has already essentially come up 
in the discussion of the Knapp example. 

What happens when G K is not a bump function adapted to k! First suppose that 
G K is a modulated bump function, more precisely suppose 

G K (£)=e- 2mxa <G K (£) 

where G K is a bump function adapted to k, and xq is an element of the hyperplane 
u)j[. Then by the above discussion, G y K will be concentrated on the R x R 1 ' 2 tube 

T KtXo ■— T K fi + Xo, 

indeed we have 

for some Schwartz function tpT K xo adapted to T K , Xo . (One could also modulate G K 
in the direction parallel to u> K instead of in the perpendicular directions, but this 
either has a negligible effect on the Fourier transform on the ball -6(0, i?), or else 
makes the Fourier transform much smaller, depending on how much modulation is 
applied) . 

Thus one can make G^ resemble a wave packet ?pT for any tube T oriented in the 
direction u K . In the general situation, where G K is a bounded function on k, then 
one can perform a Fourier series decomposition in the directions perpendicular to uj k 
to essentially decompose G K as an l 2 -average of modulated bump functions. (The 
behavior in the direction parallel to ui K , which only extends for a distance 0(1/R) 
is essentially irrelevant, thanks to the localization of physical space to B(0, R) and 
the uncertainty principle). Thus we can write 

T//w K 

where T ranges over a finitely overlapping collection of R x ^/R~ tubes in B(0, R) 
oriented in the direction lo Ki ipT is a wave packet adapted to T, and ct is a collection 
of scalars with the L 2 normalization condition X)t//w \ c t\ 2 ^5 1- O nc can then 
expand the original Fourier transform G v as 

G v = ^2 C T lpT 
T 
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where T now ranges over a separated 9 collection of tubes in B(0, R), and lot denotes 
the direction of T. 

This heuristic decomposition is an example of what is known as the wave packet 
decomposition of G v . Versions of this decomposition in the context of the restriction 
problem (or the closely related Bochner-Riesz problem) first appeared in [18], [19], 
[23], [22], and was then later developed in [6], [12], [42], [43], [60], [61], [69], [55]; this 
method also can be applied to related problems such as local smoothing or Bochner- 
Riesz, see for instance [68], [70]. The wave packet decomposition reduces the study 
of restriction estimates to that of proving estimates on the linear superpositions of 
wave packets 

\\^2c t ^t\\ Lp > ( b(om))- (20) 

T 

Note that the wave packets tpr have two main features; one at coarse scales » ^/R~ 
and one at fine scales <C V~R- At coarse scales, the wave packet is localized to a 
relatively thin tube of width \f~R~. At fine scales, the wave packet oscillates at a 
fixed frequency lot- Note that the coarse scale behavior and fine scale behavior 
are linked, because the direction of the tube at coarse scales is exactly the same as 
the frequency of the oscillation at fine scales. The issue is then how to co-ordinate 
these two aspects - localization at coarse scales, and oscillations at fine scales - of 
wave packets in order to estimate (20) efficiently. 

The first strategy for estimating these superpositions of wave packets is due to 
Cordoba [18], [19], in which the idea is to estimate the oscillatory sum by the 
associated square function 

|lEl^| 2 ) 1/2 |lLp'( B (o, fl ))- (21) 

T 

The point of doing so is that all the fine-scale oscillation has been removed from 
this problem, leaving only the coarse scale localizations to tubes. There is still of 
course the problem of estimating this non-oscillatory square function; this problem 
is essentially equivalent 10 to the problem of estimating the Kakeya maximal func- 
tion, which is another important problem in harmonic analysis, but one which we 
will not discuss in detail here. (See however [64], [13], [28]). 

Now we discuss how to estimate the oscillatory sum (20) by the square function (21). 
When p' = 2, or when n — 2 and p 1 = 4, one can bound the former by the latter 
by direct orthogonality (or bi-orthogonality) arguments, however these arguments 
do not work for other values of p' . Nevertheless, it was observed by Bourgain [6], 
[12] that one can still obtain some control of (20) by (21) in these cases, but with 
a loss of some powers of R. The idea is to break the ball B(0,R) up into cubes q of 
size VR. On such "fine-scale" cubes, a wave packet ipT = R~^ n ~ v> / 2 e 2nVjl ' I " x (j)T has 
essentially constant magnitude; to (over-)simplify the discussion, let us suppose 

9 This means that any two tubes T, T 1 in this collection either have directions differing by at 
least 1/i? 1 / 2 , or are parallel and are separated spatially by at least R 1 / 2 . 

10 Conversely, one must resolve the Kakeya conjectures in order to fully resolve the restriction 
problem, because one can use randomization arguments to show that any bound on (20) implies 
a comparable bound on (21). See e.g. [5]. 
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that 4>t is equal to 1 on q if q C T and <f>T vanishes on q otherwise. Then the 
portion of (20) coming from q is 

p— (n— l)/2|i/ \ 1 „27riwr-x|| 
if |K 2^ CTe \\LP'(B(0,R)) 

T:TDq 

while the corresponding portion of (21) is essentially 

R -(n-l)/2 R n/2p' { J- |c T | 2 ) 1 /2_ 
T:TDq 

One can then control the former expression by the latter using discrete restriction 
estimates 11 of the type (10), although the various powers of R which accumulate 
when doing so do not necessarily all cancel, and so this method of estimation can 
cause some losses 12 . 



By combining these observations with some non-trivial progress on the Kakeya 
maximal function conjecture, Bourgain [6], [12] was able to obtain certain improve- 
ments to the Tomas-Stein estimate (see Figure (4)). Further progress was made 
by Wolff [64], who improved the Kakeya estimate used in Bourgain's argument. 
By introducing bilinear (or L 4 ) methods to these arguments, further improvements 
were obtained in [42], [60], [11] [61]; one feature of these bilinear methods is that 
they could now be applied to the cone as well as the sphere or paraboloid. 

These methods, however, did not obtain sharp ranges of exponents, for a variety of 
technical reasons. The next breakthrough was achieved by Wolff [69], who solved 
(up to endpoints) the Machcdon-Klaincrman conjecture for cones, by employing 
one additional technique - that of induction on scales, which we discuss next. 



8. Induction on scales 



The strategy to prove a local restriction estimate at a scale R in the previous 
section can be summed up as follows: starting with a function G v , decompose it 
into wave packets supported on y/R x R tubes. Designating scales greater than 
as coarse, and scales less than Vi? as fine, we use oscillatory estimates such as local 
restriction estimates on fine scales, and Kakeya type estimates at coarse scales, in 
order to obtain the desired control on G v . 

This type of argument works particularly well when G is a Knapp example sup- 
ported on a disk of radius Rr 1 / 2 , so that G v is essentially a single wave packet. 
However, it becomes inefficient when G is a Knapp example spread out over a wider 

11 It is intriguing that one uses local restriction estimates at scale \/~R, together with some 
Kakeya information, to obtain local restriction estimates at scale R. This suggests a possible 
"bootstrap" approach where one could continually improve restriction estimates via iteration. 
Some partial iteration methods to this effect can be found in [11], [60], [61]; another example of 
this idea occurs in the induction-on-scales approach discussed in the next section. 

12 It is conjectured that in any dimension n > 2, that one can estimate (20) by (21) in the 
endpoint case p = 2n/(n + 1), with at most an epsilon loss R £ ; this, together with the so- 
called Kakeya maximal function conjecture, would imply the restriction conjecture. However, it 
is nowhere near solved at present, except when n = 2, and is likely to be a harder problem than 
the restriction problem itself. 
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region, e.g. a cap-type region of radius r -1 / 2 for some 1 < r < R. Then G v is 
concentrated on a much smaller set than a single wave packet - indeed, it is (some- 
what) localized to a r x y/r tube instead of an R x \f~R~ tube - but the wave packet 
decomposition requires that one decompose G v as the sum of much larger objects. 
This is a rather inefficient decomposition, and one which leads to significant losses 
in the estimates. 

The difficulty here is that the wave packet decomposition is chosen in advance, 
instead of being adapted to the particular function G being investigated. In par- 
ticular, if it turns out that G v is concentrating in a much smaller region, say a ball 
B(xo, r), then one should replace the rather coarse R x >/R wave packet decompo- 
sition by a finer one, in this case a r x y/r decomposition. 

Of course, the difficulty is that it would be incredibly complicated to actually try 
to construct such an adaptive wave packet decomposition, recursively passing from 
coarser scales to finer scales. Fortunately, a way out of this complexity was discov- 
ered by Wolff [69] - which is to hide all this recursive complexity in an induction 
hypothesis, which we now refer to as an induction on scales argument. Using this 
new idea, Wolff was able to obtain a nearly-sharp bilinear restriction estimate for 
the cone, namely that R* Si & (2x2^g) is true for all transverse compact subsets 
S\ , S2 of the cone, and all q > ^J 2 - . 

We now describe, rather informally, the idea of the argument; for a more rigorous 
presentation see [69], [55], [56], [59], [31]. Suppose inductively that we already have 
some local estimate of the form R* s {q' — > p'; a); we will now try to use this estimate 
to prove a better estimate of the form R$(q' —* p'; a — e) for some e > depending 
on a (in what follows, the value of e will vary from line to line). Iterating this, we 
will eventually be able to obtain the estimate R* s (q' — > p', e) for any e > 0, at which 
point we can use epsilon removal lemmas to obtain a global restriction estimate. 

We still have to obtain the estimate R*(q' — > p'\ a — e) from the inductive hypothesis 
R*(q' — > p';a). We first describe a somewhat oversimplified version of the main 
idea as follows. We have to prove an estimate of the form 

\\( Fda ) V \\Lp'(B(0,R)) - C P:q:S,aR a ~ e \\F\\ Lq > (s) 

for some F on S and R > 1, which we now fix. Introduce the scale r := i? 1_e , 
which is slightly smaller than R. Then by the induction hypothesis R*(q' — ► p'; a) 
applied to scale r, we have 

IK^ cr ) V |lLp'(S( a ;o,r)) ^ C P,q,S,c,R a ~ S \\F\\ Lq > {s) 

for any ball B(xo,r). Thus we can already prove the desired estimate on smaller 
balls B(xo,r). More generally, we can prove 

\\( Fda Y\\Lp'{[j J B(x J .T)) ^ Cp,q,S,aR a ~ S \\F\\ Lq > (s) 

on any union IJ^ B(xj, r) of smaller balls, as long as the number of balls involved 
is not too large (e.g. at most 0((\ogR) c ) for some absolute constant C). 
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As a rough first approximation, the idea of Wolff is to identify the "bad" balls 
B(xj,r) on which the function (Fd<r) v "concentrates"; the choice of these balls 
will of course depend on F. These balls can be dealt with using the induction 
hypothesis, and it then remains to verify the restriction estimate on the exterior of 
these bad balls: 

\\(Fd(j) V \\ LP >( B ( iR )_\J. B ( Xj ,r)) ^ Cp,q,S,aR a ^ \\F\\ Li' (S)' 

The above description of Wolff's argument was something of an oversimplification 
for two reasons; firstly, Wolff is working in the bilinear setting rather than the linear 
setting, and secondly the balls B(xj,r) turn out to depend not only on the original 
function F, but of the wave packet decomposition associated to F. Let us ignore 
the first reason for the moment, and clarify the second. On the ball B(0,R), one 
can obtain a wave packet decomposition of the form 

(Fda) v (x) = ^TcWt. 

T 

Because the argument of Wolff dealt with the cone, the wave packet decomposition 
here is slightly different from that discussed in the previous section, in two respects: 
firstly, the tubes T are oriented on "light rays" normal to the cone S instead of 
pointing in general directions, and secondly the internal structure of the wave packet 
ipT is more interesting than just the product of a plane wave and a bump function, 
being decomposable into "plates" . We however will gloss over this technical issue. 

For simplicity, let us suppose that the constants ct behave like a characteristic 
function; more precisely, there is some collection T of tubes such that ct — c for 
T e T and ct — otherwise. (The general case can be reduced to this case via a 
dyadic pigeonholing argument, which costs a relatively small factor of logi?). Then 
we have 

(Fda) v (x) =cJ2 V"r(z). 
tgT 

The idea now is to allow each wave packet Vt to be able to "exclude" a single ball 
Bt of the slightly smaller radius r. In other words, one divides (Fd<r) v into two 
pieces, a "localized" piece 

c ^t{x)xb t {x) 
tgT 

and the "global" piece 

c ^ iPt(x)(1 - Xb t {x)). 
TeT 

One then tries to control the localized piece using the induction hypothesis, and 
then handle the non-localized piece using the strategy of the previous section. 

In the linear setting, this strategy does not quite work, because the localized pieces 
cannot be adequately controlled by the induction hypothesis. However, in the 
bilinear setting, when one is trying to prove an estimate of the form 

||(FicZeri) v (F 2 d<T 2 y '|| LP > /2 (B (o,i?)) ^ C P,q,s,aR a ~ e \\Fi\\ Lq > (s) \\F 2 || L ,'(S) 



RECENT PROGRESS ON RESTRICTION 



21 



then one can decompose 



Tj eT : , 



for j = 1, 2, and allow each tube Tj to exclude a single 13 ball B Tj of radius r. We 
can then split the bilinear expression 



Tl eT lT2 eT 2 

The local piece turns out to be easily controllable by the inductive hypothesis, so 
it remains to control the global piece. 

The key point is to prevent too many of the tubes Ti and T 2 from interacting with 
each other. This is done by selecting the balls Bt 11 Bt 2 strategically. Roughly 
speaking, for each tube Ti, we choose Bt x to be the ball which contains as many 
intersections of the form T\ n T 2 as possible; the ball Bt 2 is chosen similarly. The 
effect of this choice is that any point x which lies in a large number of tubes in T\ 
and in T 2 simultaneously, is likely to be placed primarily in the local part of the 
bilinear expression, and not in the global part. 

With this choice of the excluding balls Bt x , Bt 2 , Wolff was able to obtain satis- 
factory control on the number of times tubes Ti from Ti would intersect tubes T 2 
from T 2 . The key geometric observation is as follows. Suppose that many tubes T\ 
in Ti were going through a common point xq; since the tubes T± are constrained 
to be oriented along light rays, these tubes must then align on a "light cone" . 

Now consider a tube T 2 from T 2 ; this tube is of course transverse to all the tubes 
T\ considered above, and furthermore is transverse to the light cone that the tubes 
T\ lie on. It can either pass near x , or stay far away from x - In the first case it 
turns out that the joint contribution of the tubes T± and T 2 will largely lie in the 
local part of the bilinear expression and thus be manageable. In the second case 
we see from transversality that the tube T 2 can only intersect a small number of 
tubes Ti. 

Thus if there is too much intersection among tubes T\ in Ti, then there will be 
fairly sparse interection between those tubes T\ and tubes T 2 in T 2 . This geo- 
metric fact was exploited via combinatorial arguments in [69] , and when combined 
with some local L 2 arguments from [37] to handle the fine scale oscillations, and 

is A 

dually, in Wolff's argument there are 0((log R) c ) such balls excluded, but this is a minor 
technical detail. 
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the induction on scales argument, was able to obtain the near-optimal bilinear re- 
striction theorem R* Si S2 (2 x 2 — > q) for q > . (The endpoint q = ^±2 to the 
Machedon-Klainerman conjecture was then obtained in [55] by refining the above 
argument). 



9. Adapting Wolff's argument to the paraboloid 

The above argument of Wolff [69] , which yielded the optimal bilinear L 2 restriction 
theorem for the cone, relied on a key fact about the cone: all the tubes passing 
through a common point x n , were restricted to lie on a hypersurface (specifically, 
the cone with vertex at xq). This property does not hold for the paraboloid, since in 
this setting the tubes can point in arbitrary directions. Nevertheless, it is possible 
to recover this hypersurface property by exploiting a little more structure at fine 
scales, and more precisely by squeezing one "dimension" of gain out of (18), thus 
obtaining the optimal bilinear L 2 restriction theorem for the paraboloid (and in fact 
also for the sphere, by a slight modification of the argument); this was achieved in 
[59]. We sketch the main idea of that paper here. 

As in the last section, we can reduce matters to estimating a quantity such as 

II Y i'T 1 1pT 2 0--XBT 1 nBT 2 )\\L^B(,0,R)) 

T ie T lT2 eT 2 

for some 1 < q < 2. It turns out in this case that one can obtain good bounds simply 
by interpolating between L 1 and L 2 bounds. The L 1 bound is fairly trivial (using 
Cauchy-Schwarz to reduce to L 2 bounds on X^t gT ■> wn i cn can be handled by 
orthogonality arguments), so we turn to problem of estimating the L 2 quantity: 

II Y V'TiV'T^l - XB Tl nB T2 )\\ 2 L 2( B (0,R)y 

TigTi T 2 eT 2 

As is customary, we subdivide the large ball B(0, R) into cubes q of size VR. The 
contribution of each cube q is 

II Y Y ^!^T 2 (1 - XB Tl nB T2 )\\ 2 L 2 {q y 

T 1 eT 1 T 2 aT 2 

Roughly speaking, we only need to consider pairs Ti , T 2 of tubes which pass through 
q (because of the localization of tpTt and tpT 2 , and such that q is not contained in 
both Bt x and Bt 2 - For sake of argument, suppose that we only consider the terms 
where q (£_ B Tl . Then we can rewrite the above expression as 

II Y Y ^T 2 \\h (q) 

T.eT'^g) t 2 gT 2 ( 9 ) 

where T 2 (q) denotes all the tubes T 2 in T 2 which intersect q, and (q) denotes all 
the tubes T\ in Ti which intersect q and for which q (£_ Bt x ■ Note that the tubes 
in T[(q) must point in essentially different directions (since they all go through q, 
and are essentially distinct tubes), and similarly for T 2 (g). 
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The function X^TieT' (<?) behaves roughly like the function -^{xn'^dciY , where 
(q) is the subset of the paraboloid whose unit normals lie within 1/R of the di- 
rections of one of the tubes in T[(q). This can be seen by recalling the origin of 
these wave packets -0Ti , as Fourier transforms of functions on 5* (or more precisely 
on Ni/ R (S); the discrepancy between the two explains the factor). The func- 
tion X^T2eT 2 ((3) ^ T2 ^ s similarly comparable to the expression j^(xn 2 (q)d<T2) v for a 
suitable set ^(q). Thus one is faced with an expression of the form 

ll(Xni(,)^i) V (Xn a (,)^2) v ||^ (R » )) (22) 

where we have discarded some powers of R for sake of exposition, as well as the 
localization to q. As observed previously the restriction estimate R*(2 x 2 — > 
2) allows us to bound this quantity by something proportional to |ri' 1 (<7)||fi 2 ((7)|. 
Actually we can do a little better and refine this to, say, |r2' 1 (q)||fi2(<z)| 2 ; this comes 
from not discarding the constraint £2 € ^2 m the argument immediately preceding 
(18), and by exploiting the localization to q more; we omit the details. This is the 
type of bound used in Wolff's argument, in combination with the combinatorial 
arguments controlling the multiplicity of the tubes in Ti and T2 mentioned in 
the previous section, to obtain a sharp bilinear estimate in the case of the cone. 
However, this bound is insufficient for the paraboloid case because of the failure of 
the tubes Ti through a point to lie on a hypcrsurface. 

Fortunately, this can be rectified by exploiting the gain inherent in (18). Indeed, 
by refusing to use (18) one can obtain a bound on (22) which is proportional to 

\Cl' 1 (q)\\n 2 (q)\snp\Q 2 (q)n7r^ v i 4 )\. 

This is similar to the bound of | S~2^ ((7) 1 102(^?) | 2 mentioned earlier, but is a little 
improved because one of the factors of ^2(9) is restricted to a hyperplane. When 
one inserts this bound back into the coarse-scale combinatorial analysis of Wolff, 
this effectively allows us to restrict the tubes T 2 passing through a cube q to be 
incident to a hyperplane. This turns out to be a good substitute for the hypersurface 
localization property used in Wolff's argument, and is the key new ingredient which 
permits us to generalize the bilinear cone estimate to paraboloids (and by similar 
reasoning to other positively curved surfaces, such as the sphere). 

One interesting feature of this argument is that it introduces a non-trivial correla- 
tion between the fine-scale analysis and the coarse-scale analysis; one may speculate 
that future developments on these problems will deal with the fine-scale and coarse- 
scale aspects of the restriction operator in a more unified manner. 
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